Alleviating Search Uncertainty Through Concept Associations: Automatic Indexing, Co-Occurrence Analysis, and Parallel Computing

نویسندگان

  • Hsinchun Chen
  • Joanne Martinez
  • Amy Kirchhoff
  • Tobun Dorbin Ng
  • Bruce R. Schatz
چکیده

In this article, we report research on an algorithmic apgather, process, and retrieve information. These systems proach to alleviating search uncertainty in a large inforprovide a wide variety of information and services, rangmation space. Grounded on object filtering, automatic ing from daily updates of foreign and national news, indexing, and co-occurrence analysis, we performed a movie reviews and clips, law cases, and financial data large-scale experiment using a parallel supercomputer on companies to journal articles, books, trademarks, and (SGI Power Challenge) to analyze 400,000/ abstracts in an INSPEC computer engineering collection. Two sysstatistics. However, gaining access to such information is tem-generated thesauri, one based on a combined oboften difficult. This is due, in large part, to the indeterminject filtering and automatic indexing method, and the ism involved in the process by which information is inother based on automatic indexing only, were compared dexed, and to the latitude searchers have in expressing a with the human-generated INSPEC subject thesaurus. query. Our user evaluation revealed that the system-generated thesauri were better than the INSPEC thesaurus in concept recall, but in concept precision the 3 thesauri were 2. Using Thesauri to Alleviate Search comparable. Our analysis also revealed that the terms suggested by the 3 thesauri were complementary and Uncertainty: Literature Review could be used to significantly increase ‘‘variety’’ in search terms and thereby reduce search uncertainty. 2.1. Indexing and Search Uncertainty

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Parallel Computing Approach to Creating Engineering Concept Spaces for Semantic Retrieval: The Illinois Digital Library Initiative Project

This research presents preliminary results generated from the semantic retrieval research component of the Illinois Digital Library Initiative (DLI) project. Using a variation of the automatic thesaurus generation techniques, to which we refer as the concept space approach, we aimed to create graphs of domain-speciic concepts (terms) and their weighted co-occurrence relationships for all major ...

متن کامل

A Parallel Computing Approach to Creating Engineering Concept Spaces for Semantic Retrieval: The Ill - Pattern Analysis and Machine Intelligence, IEEE Transactions on

This research presents preliminary results generated from the semantic retrieval research component of the Illinois Digital Library Initiative (DLI) project. Using a variation of the automatic thesaurus generation techniques, to which we refer as the concept space approach, we aimed to create graphs of domain-specific concepts (terms) and their weighted co-occurrence relationships for all major...

متن کامل

Co-occurrence of the Concepts Hikmah and Kitab in the Quran and the Generation of a New Concept

Kitab and hikmah are two concepts discussed in the commentaries of the Quran. The main purpose of the present article is the analysis of the concept hikmah when it co-occurs with kitab in the holy Quran. It is currently claimed that the concept hikmah has a different meaning from the lexical and idiomatic hikmah in the Quran in cases where it co-occurs with kitab. The research method in the pre...

متن کامل

Cloud Computing Technology Algorithms Capabilities in Managing and Processing Big Data in Business Organizations: MapReduce, Hadoop, Parallel Programming

The objective of this study is to verify the importance of the capabilities of cloud computing services in managing and analyzing big data in business organizations because the rapid development in the use of information technology in general and network technology in particular, has led to the trend of many organizations to make their applications available for use via electronic platforms hos...

متن کامل

Journal of Emerging Trends in Computing and Information Sciences::Automatic Learning Context Tool for Effective Personal Document Indexing and Retrieval

Managing digital documents has become a time consuming process due to sheer scale. Most users manage their personal documents by creating logical hierarchical folder structures. This logical structure depends on the user’s assessment of the context of the document. Basic file structuring has not been changed for decades and hierarchical file structure remains the same. But there has been a surg...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • JASIS

دوره 49  شماره 

صفحات  -

تاریخ انتشار 1998